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Abstract 

We extend our previous study of scaling range properties done for detrended 
fluctuation analysis (DFA) [IJ to other techniques of fluctuation analysis (FA). The 
new technique called Modified Detrended Moving Average Analysis (MDMA) is in- 
troduced and its scaling range properties are examined and compared with those 
of detrended moving average analysis (DMA) and DFA. It is shown that contrary 
to DFA, DMA and MDMA techniques exhibit power law dependence of the scal- 
ing range with respect to the length of the searched signal and with respect to the 
accuracy E? of the fit to the considered scaling law imposed by DMA or MDMA 
schemes. This power law dependence is satisfied for both uncorrelated and auto- 
correlated data. We find also a simple generalization of this power law relation 
for series with different level of autocorrelations measured in terms of the Hurst 
exponent. Basic relations between scaling ranges for different techniques are also 
discussed. Our findings should be particularly useful for local FA in e.g., econo- 
physics, finances or physiology, where the huge number of short time series has to 
be examined at once and wherever the preliminary check of the scaling range regime 
for each of the series separately is neither effective nor possible. 
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1 Introduction and motivation. 



The presence of long memory effects in complex systems has been studied by many authors 
in variety of contexts and in various areas of science. Complexity of the system can be 
translated into properties of data series exhausted from the system. Therefore, a lot of 
information on complexity can be drawn from the precise study of memory effects in data. 
The latter effects are most easily searched in stationary time series Xj, {i = 1,...,L), by 
two point autocorrelation function Cg = (AxjAxj+s), where () is the average taken from 
all data in the signal with series increments Asj = Xj+i —Xi. The autocorrelation function 
changes with time lag s according to the power law [21 [3]: 

C.^(2-7)(l-7)s-^ (1) 

where the autocorrelation scaling exponent 7 (0 < 7 < 1) stands for the level of memory 
in signal and its two edge values correspond respectively to fully correlated (7 = 0) or 
completely uncorrected (7 = 1) data. 

The estimation of scaling exponent 7 can be done using various methods. The most 
popular are based on the so-called fluctuation analysis (FA) [1]. In the FA approach the 
sequence Axi is treated as steps of a discrete random walk, i.e., one builds the cumulated 
series Xt = {t = then the variance of its displacement is found by 

averaging over different time windows of length t. The scaling exponent H of the series, 
called also the Hurst exponent E] can be measured because of power law relation: 

var{xt) = {xD - {xtf ~ t'^, (2) 

where war() is the variance. The scaling exponent H is related to the autocorrelation 
exponent 7 by the linear relationship [7]: 

^ = 1 - I (3) 

One has to keep in mind that a direct calculation of correlation functions and 7 
exponent is hindered by the level of noise present in time series and by possible non- 
stationarities in the data. To reduce these effects one usually does not calculate 7 directly, 
but study instead the integrated profile of the data, i.e. Xt instead of Axj series. This 
approach makes the fundament of all FA methods. 

One should also remember that time series Xt has to be detrended before using Eq.(2), 
otherwise an artificial bias is introduced leading to incorrect values of the scaling expo- 
nents. The so-called detrendization procedure, i.e. subtraction of trend in given data, 
was first proposed in detrended fluctuation analysis (DFA) [H [9l [10] and then in detrended 
moving average analysis (DMA) pT| [T2| [T3| HI] . In the case of DFA, the trend is defined as 
the polynomial filll] to data in the considered time window of length r. The mean-square 
fluctuation F^{t) of the signal around its trend (detrended signal) is calculated and then 
F'^{t) is averaged over all time windows of length r. 

^usually the linear fit is sufficient 
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One expects, according to Eq.(2), that the power law 

{FlFA{r)),o. - r^^ (4) 

is fulfilled, where {.)hox is the expectation value, i.e. the average taken over all time 
windows of fixed size. The above equation provides one of the most frequently used 
methods in stationary and even nonstationary series to calculate their characteristic main 
H exponent and the autocorrelation properties induced by Eq.(l). 

It was examined in our former article [1] that in the case of DFA the scaling range 
(i.e. the range of r values satisfying Eq.(4)) is a linear function of data length L and the 
goodness of line fit B? in log- log scale to Eq.(4). More precisely, the formula 

Xdfa{,u,L,H) = {{aH + I3)u + aQ)L + -i (5) 

with u = \ — B? was found for DFA scaling range {Xdfa) and the coefficients a, /3, 7 
were estimated numerically (see Ref.[I] for details). 

In DMA, contrary to DFA, a trend is found as the moving average of the assumed 
length A and is calculated from data points immediately preceding the given one, say Xi. 
It means that statistics of data points entering the calculation of detrended signal depends 
strongly on the chosen length A of the moving average. For given A only data points Xi 
with i> \ can be taken into account for detrended fluctuation according to [TT] 

1 ^ 

i=\ 

where {xi)\ is the moving average of length A defined as 

1 ' 

= J^^k (7) 

k=i-X+l 

and plays the role of a trend in data series. 

The power law similar to DFA is expected pT| [T2] : 

F'omaW - (8) 

The limitations in statistics imposed by DMA can easily be skipped in a simple and 
straightforward way, quite acceptable in practical applications. Such modification of 
DMA, called by us modified DMA (MDMA) is presented in the next section. Then we 
analyze the scaling range properties of these two methods and compare them with DFA 
results. Similarly to our previous approach [1], the most interesting case to examine is 
the scaling range property for short series what is particularly useful for local FA, e.g. 
in econophysics and finances [l5l[T6l[I71[T8l[l9l|20l[2T] or physiology [221 ES]. Often a 
huge amount of short time series has to be examined at once in these applications and the 
preliminary check of the scaling range regime for each of the series separately is neither 
possible nor effective. 
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2 Introducing modified detrended moving average 
analysis (MDMA) 

DMA has a diversified statistics for detrended data entering Eq.(6) witli various lengtlis 
A of moving averages. For tlie particular clioice of A and for the series length L > X only 
L — A + 1 detrended data points enter the mean-square fluctuation Fj)j^j^ in Eq.(6). Hence, 
the more reliably a trend is determined within DMA, the weaker statistics is available 
for estimation of the scaling exponent H. Obviously, it is very uncomfortable situation. 
Especially, it leads to shorter scaling regimes for discussed scaling power laws for larger 
A valueqj. This obstacle may be rather easy eliminated in the new proposal explained 
below. 

One usually deals in practise with time series where, for various reasons, only a part of 
data is taken for further processing. In finance for instance, we investigate series of data 
starting at some moment to and terminating, say, at to + Often, it does not mean that 
earlier data for t < to are not available. We do not explore them but they usually exist 
and can be used as the background to calculate the necessary trends (moving averages) 
if one wants to. Therefore, we propose to modify slightly the scheme calculating -F|)jv/a 
in Eq.(6). The new version called MDMA assumes that some amount of data is stored 
before the basic time series {xi} {i = 1, ...,L). The whole available amount of data can 
therefore be written as the sequence {x-x^^^, x_2, X-i, xi, X2, xl}, where Xmax is the 
maximal used scaling range to be determined in this article. Such an approach enables 
to calculate trend (moving averages) for those data points where DMA procedure with 
particular choice of A simply fails. To be precise, Eq.(6) is replaced within MDMA by 

1 L 

FlIDMAW = YJ2^^^-{^^)x)' (9) 
i=l 

where the moving average of length A is modified to: 

— 1 * 

fc=i-A+l,A;^0 

and A; < indicates the sum running over additional data preceding the basic series. 
The power law in Eq.(8) is still kept when -Fm£)ma(-^) is substituted for F^n^j^{\): 

FIidmaW-^''' (11) 

It is easily seen just from the construction recipe that MDMA produces the same 
scaling exponents as DMA but with bigger accuracy due to larger scaling range available 
for calculations. We will refer to this issue in the following sections. 

^scaling properties for small A are also inappropriate because moving averages with small A do not 
reconstruct real trends 
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3 DMA and MDMA scaling ranges for uncorrelated 
and autocorrelated data 



We begin with statistical analysis of an ensemble of artificially generated uncorrelated 
time series with the given length L. Then we find in the same way as in Ref. |l] the 
percentage rate of series which are below the specified level of regression line fit in 
log-log scale, induced by Eq.(8) in the case of DMA and by Eq.(9) in the case of MDMA, 
assuming that the maximal length of moving average Xmax is being fixed. Fig. 1 illustrates 
the rejection rate, i.e. the percentage rate of series not matching the assumed criterion for 
i?^, calculated for the series of uncorrelated {H = 1/2) data of length L = 10^ for different 

values and for running Xmax- DMA and MDMA cases are shown for comparison to 
indicate that the scaling range for MDMA is bigger than for DMA, assuming the same 
requirements of the goodness fit R^. 

In further analysis we took two specific rejection thresholds: 2.5% and 5% corre- 
sponding to confidence levels (CL): 97.5% and 95% respectively. All data have been 
gathered from the ensemble of 5 x 10^ generated time series with a length varying be- 
tween L = 5 X 10^ and L = 10^. 

Let us introduce new parameter u = 1 — R"^ and investigate the X{u,L) dependence 
for DMA and MDMA methods in the same way as we did before for DFA in Ref. We 
have made throughout this paper the lower limit restriction for the minimal length of used 
moving averages (Xmin = 8) because below this threshold a significant lack of scaling in 
DMA is observed □. All values were taken from the analysis of dependencies like in Fig. 1. 
The functional dependence X{u,L) is not linear contrary to previous findings for DFA. 
Therefore we used log-log plots first to verify if it has power law origin. This hypothesis 
turned out to be very fruitful. Figs. 2a, 3a convince that the scaling range X{u,L) for 
DMA and MDMA methods factorize as 

\og{X{u,L)) = A{u) log L + B{u) (12) 

where, according to Figs. 2b, 3b, A{u) and B{u) are also linear in log(u). 
Hence, we expect the precise form of Eq.(12): 

log(A(u, L)) = (alogM + ao) logL + bhgu + bo (13) 

with some real constants a, ao, b, bo- 

The more detailed exploration of A{u) and B{u) dependencies (see Fig. 4a,b) provides 
arguments for just three free parameters entering the fit of Eq.(13), since a = for both 
considered confidence levels. Therefore, one arrives with the final power law formula 
linking the scaling range X{u, L) of uncorrelated time series with u and L for DMA and 
MDMA: 

A(m,L)) = DLV (14) 

^ shorter moving averages do not determine the local trend precisely what causes the appearance of 
artificial autocorrelations in detrended data and makes the similar effect as too small time window boxes 
in DFA 
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Table 1: Results of the best fit for coefficients in Eq.(14) describing scaling range for DMA. 
The fit is done for series with various autocorrelation level measured by H exponent and 
for chosen two confidence levels CL: 97.5% and 95%. 
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Table 2: Results of the best fit for coefficients in Eq.(14) describing scaling range for 
MDMA. Same notation applies as in Table 1. 



where D, t] and ^ are constants related in obvious manner with oq, b and 6o- 

We have checked that the same qualitative scenario is fulfilled also for long range 
correlated series {H > 1/2). Our examination was limited to series with 0.5 < H < 0.9 
which are most often met in practise in variety of areas. Figs. 5,6 show the extension 
of findings from Figs. 2,3 to cases of signals with long memory. The exemplary plots 
revealing the A{u) and B{u) relations for the autocorrelated series (the case H = 0.8) are 
shown in Figs. 4c, 4d. An interesting challenge is to find the quantitative form of D{H), 
r]{H) and ^{H) functions which generalize Eq.(14) when applied to series with memory. 
This task is shifted to section 4. 

The best fit parameters to Eq.(14) are gathered in Table 1 for DMA and in Table 2 
for MDMA for two distinct confidence levels. We required simultaneously minimization 
of the mean relative absolute error (MAE) and the maximal relative error (ME) for each 
of the fitting points in the same way as it was formerly done by us for DFA (see [T] for 
details). The fit was based on nearly 100 data pairs {ui,Lj), where Ui = 5 x 10^^(1 + i) 
with i = 1,2,. ..,9 and Lj = 500,600,800,1000,1200,1500,1800,2400,3000,6000,10000 
respectively. 

The MAE denoted as AmaeW was specified as 

AMAEiX) = 1/A^fo) KKn^L) - K,{u,L))/K,{u,L)\ (15) 

where \ij{u,L) = X{ui,Lj) is the fitting value of Eq.(12) for particular Ui and Lj, N(^ij) 
counts different (ij) pairs and X^^^iu, L) is the respective value of scaling range simulated 
numerically from an ensemble of time series. 
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Similarly, ME marked as ^me was defined as 

/\me = max(|A^'''^(M,L) - \ij{u, L))\/ \ij{u, L)) (16) 

In the case of autocorrelated signal all data were generated by Fourier filtering method 
(FFM) algorithm [21]. 

4 Towards unified model of scaling ranges 

Finally, we will look for an unified formula containing a minimal number of free parameters 
and describing scaling ranges of both uncorrelated and autocorrelated data within DMA 
or MDMA techniques. 

All data taken from Tables 1-2, when drawn against the Hurst exponent values (see 
Fig. 7), prove the existence of simple linear relations 

_ ^ ^^^^ 

^M^^M_^M^ (19) 

where m = 1, 2 corresponds respectively to DMA and MDMA methods . 

Note that D^^^ = 0, while D^^^ ^ for all confidence levels. This simplifies the unified 
formula for scaling ranges in MDMA method 

Xmoma{u, L) = Df^HW'^'-^'^'^u^''^'^'^ (20) 

It contains only 5 free parameters, once the respective unified model for scaling ranges in 
DMA contains one parameter more {D^q^): 

XoMAiu, L) = (D« + D^^^H)L^'o'-^'^'^u^^''-^'^''^ (21) 

The values of these parameters, taken directly from the regression line fit of Figs. 7a- 7f, 
define the global fit to Eqs.(20)(21). They are specified in Table 3 for DMA and in Table 
4 for MDMA, together with corresponding MAE and ME values. 

(2,) 

Let us notice that the fitting parameters for Xmdma except Di coincide for both 
confidence levels. This simplifies scaling range calculations for the newly introduced 
scheme. 
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Table 3: Results of the best fit for coefficients in unified formula in Eq.(21) for DMA 
done for all data coming from series with various autocorrelation levels and for two chosen 
confidence levels CL: 97.5% and 95%. 
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Table 4: Results of the best fit for coefficients in Eq.(20) describing scaling ranges for 
MDMA. 

5 Final remarks, discussion and conclusions. 

In this study we searched for the scaling range properties of DMA technique which links 
fiuctuations of detrended random walk -F^(A) with the length of the moving average A 
taken to subtract the trend in fiuctuations. Our simulations have been made on the 
ensemble of 5 x 10^ short and medium-length time series of length 5 x 10^ < L < 10^ 
with varying level of long memory imposed by Hurst exponent 0.5 < H < 0.8. The latter 
specific spread for H values was used to refer to cases of long range autocorrelated data 
one may most often encounter in practise. We introduced also slightly modified version 
of DMA, called by us MDMA, and examined their scaling properties in comparison with 
DMA. 

It turns out that scaling ranges for DMA reveal a simple power law relationship with 
the length of data L and the goodness of linear fit for the fundamental equation 
(Eq.(8)) which links the Hurst exponent H of time series with detrended fiuctuations of 
a signal -F^(A) around its local trend. We found also that the similar relation is fulfilled 
for the newly proposed in this paper fiuctuation technique called MDMA. A numerical fit 
to parameters describing the power law for scaling ranges in both methods was done. 

It is evident from Eqs.(20)(21) and Tables 3-4 that the scaling ranges Xdma of DMA 
and \mdma of MDMA satisfy Xdma < Xmdma for the same fixed length of data and 
the same goodness of fit (see Figs. 8,9). More precisely, Xmdma is around 10% — 20% 
larger than Xdma for uncorrelated data and 40% — 50% larger than Xdma for highly 
autocorrelated data. 

It is also remarkable that the typical scaling range in DMA for ~ 0.98 is surprisingly 
short and does not exceed 10% of the series length. It grows up to ~ 20% of series length 
if we loosen the requirements for goodness of fit to Eq.(8) down to R^ ~ 0.95. 

One may ask about a relationship between scaling ranges of DMA, MDMA and DFA 
- the latter one considered in [T]. The summarized relationships between Xdma, Xmdma 
and Xdfa based on Eqs. (5) (20) (21) are specified in Figs. 8, 9. We notice that the mutual 
hierarchy between Xdfa, Xdma and Xmdma is ambiguous and strongly depends on the 
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length of considered data L, the assumed goodness of fit (or u parameter) and the level 
of autocorrelations in a signal. We see that \dfa dominates over \dma and Xmdma for 
longer (L > 3 x 10'^) uncorrelated series of data, while for strongly autocorrelated signals 
one obtains Xdfa < Xdma < Xmdma independently on the data length. 

Finally we considered contours in (u, L) plane representing solutions of the equality 
Xdma{u, L, H) = \£)fa{u,L,H) for fixed values of H exponent. The similar analy- 
sis is repeated for solutions of Xmdma{u, L, H) = \dfa{u, L, H). They are drawn in 
Figs. 10a, 10c and in Figs. 10b, lOd respectively. Each of these contours divides {u,L) 
plane into two regions: top right region above the corresponding curve where the scaling 
range for DFA is larger than for DMA, and the bottom left area where it is on the oppo- 
site (see Figs. 10a, 10c). The similar graphic representation for Xmdma vs Xdfa scaling 
ranges as a function of L, u and H is drawn in Figs. 10b, lOd. The absence of contours for 
MDMA for H = 0.7 and H = 0.8 in Fig. 10b {CL = 97.5%) and for H = 0.6, H = 0.7, 
H = 0.8 in Fig. lOd {CL = 95%) indicates that for these autocorrelated series the scahng 
range for MDMA is always larger than the one for DFA . The top points terminating all 
contours show the maximal length of consecutive autocorrelated time series for which an 
intersection of scaling ranges between DFA and DMA or DFA and MDMA may occur 
(see also Figs. 8,9). For longer series with the particular level of autocorrelations Xdma 
(or Xmdma) always exceeds Xdfa for any chosen u value. 

All the revealed relations are believed to make an useful tool in determination of 
scaling ranges, especially if there is a need to consider large data sets arranged in a big 
number of shorter time subseries, e.g. in a search for evolving (time-dependent) local 
Hurst exponent in various areas and applications. 

It is worth to discuss the efficiency of all three methods in a proper determination of 
H scaling exponent which takes the precisely determined scaling ranges into account. It 
is beyond the scope of this article and will be a subject of forthcoming study [25] . 
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Figure 1: Percentage rate (%) of rejected time series as a function of scaling range A, 
and goodness of fit i?^ drawn for DMA (a) and MDMA (b) 
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Figure 2: Dependence between scaling range A and total length of data L (a) and goodness 
of fit u = 1 — (b) in log-log scale for DMA and MDMA at the confidence level 
CL = 97.5% 
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Figure 3: Same as in Fig. 2a, 2b but for CL = 95% 
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Figure 4: Dependence of A and B coefficients from Eq.(12) on m = 1 — R^. It is seen 
that A{u) does not depend on u. Top two panel (a), (b) correspond to uncorrelated 
series {H = 0.5), while bottom ones (c),(d) are related to strongly auto correlated data 
(H = 0.8). 
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Figure 5: Dependence between scaling range A and length of time series L for various 
levels of autocorrelation in data (measured by H exponent). Plots are drawn for particular 
choice = 0.98 and look qualitatively the same for other values (not shown). Perfect 
linear dependence logA(logL) is observed in whole domain 0.5 < H < 0.8 for both DMA 
and MDMA methods. Fitting lines are shown only for the edge values H = 0.5 and 
H = 0.8 to make all remaining plots more readable. 
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Figure 6: Same as in Fig. 5 but for A dependence on u = 1 — R"^ with fixed length of data 
L = 10^. Results for other data lengths (not shown) are qualitatively identical. 
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Figure 7: Dependence of the best fit parameters D, rj, from Eq.(14) on the Hurst 
exponent H (autocorrelation level) for DMA and MDMA methods for two confidence 
levels: CL = 97.5% and CL = 95%. Dotted lines in two top panels, making an extension 
of the fit D{H) towards H = ior MDMA, prove that of^ = (see also Eq.(17)). 
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Figure 8: Hierarchy of scaling ranges for DFA, DMA and MDMA methods given as 
a function of series data length L for uncorrelated (a) and strongly autocorrelated (b) 
signals for CL = 97.5%. Inbox in top panel shows details of this hierarchy for very short 
series (L < 3 x 10^). It is evident how the hierarchy changes with increasing H (compare 
(a) and (b) panels). Note that for uncorrelated data {H = 0.5) Xdfa > ^mdma > ^dma 
if L > 3 X 10^, while Xmdma > Xdma > Xdfa for arbitrary length of data for H = 0.8. 
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Fig. 8a. 
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Figure 10: Solutions in {u,L) plane of the relationships: Xdma{u, L, H) = Xdfa{u, L, H) 
(a,c), Xmdma{u, L, H) = Xdfa{u, L, H) (b,d) for fixed values of Hurst exponent H and 
CL. Top right part of {u, L) area above each contour corresponds to {u, L) values where 
Xdfa > Xdma (Fig. 10a, 10c) or Xdfa > Xmdma (Fig. 10b, lOd). Bottom left part of all 
figures indicates the opposite situation. 
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